PERF: perform reductions block-wise #29847

jbrockmendel · 2019-11-26T00:58:43Z

No description provided.

…rf-reduce

TomAugspurger · 2019-12-05T20:50:30Z

pandas/core/internals/managers.py

+        for blk in self.blocks:
+            bres = func(blk.values, *args, **kwargs)
+            if np.ndim(bres) == 0 and blk.shape[0] != 1:
+                # i.e. we reduced over all axes and not just one; re-do column-wise


I don't quite understand this case. How do we get here? re-calling func doesn't seem ideal.

IIRC it was when we have axis=None

after some digging, this appears to be coming from nanops funcs that are either not getting axis passed or are not handling it correctly.

updated to make this check unnecessary

…rf-reduce

pep8speaks · 2019-12-22T20:40:41Z

Hello @jbrockmendel! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2019-12-27 20:20:58 UTC

jbrockmendel · 2019-12-22T20:42:45Z

Consolidated the conditions under which we go through the block-wise path. In a follow-up, this will allow for significant cleanup of the non-blockwise path (i.e. existing _reduce code) because the set of cases it has to handle is cut down.

jbrockmendel · 2019-12-25T00:05:49Z

i hypothesize that after this and #30416 we'll be able to get rid of quite a bit of the existing special case handling in DataFrame._reduce.

…erf-reduce

…rf-reduce

jbrockmendel · 2019-12-27T20:52:41Z

docbuild failure looks unrelated

jbrockmendel · 2019-12-30T16:56:46Z

@jreback @TomAugspurger ideally id like to get this and #29941 in before the RC.

TomAugspurger

Changes look nice. Any user-facing changes that need a whatsnew?

Do you think we have sufficient test coverage here?

TomAugspurger · 2019-12-30T17:21:25Z

pandas/core/frame.py

+            # After possibly _get_data and transposing, we are now in the
+            #  simple case where we can use BlockManager._reduce
+            res = df._data.reduce(op, axis=1, skipna=skipna, **kwds)
+            assert isinstance(res, dict)


Planning to keep these asserts in?

jbrockmendel · 2019-12-30T17:29:10Z

Any user-facing changes that need a whatsnew?

None that I've identified.

Do you think we have sufficient test coverage here?

It seems pretty solid.

Planning to keep these asserts in?

Fine by me either way.

jreback · 2019-12-30T18:13:22Z

ok by me (assuming you are going to back and de-gross / consolidate this code at some point).

jbrockmendel · 2019-12-30T18:18:28Z

assuming you are going to back and de-gross / consolidate this code at some point

The code added here is is pretty de-grossed. Some of the assertions could be removed to make it less verbose, but unless we get 2D EAs (which is a windmill ive mostly stopped tilting at), this is about as good as it gets.

That said, inside DataFrame._reduce below the code introduced here is a bunch of really gross code that we can clean up after this.

jreback · 2020-01-01T17:18:23Z

k thanks

jbrockmendel added 7 commits November 24, 2019 18:38

REF: implement DataFrame reductions blockwsie

d1d07ff

handle axis==1 with numeric_only

9469400

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

038697a

…rf-reduce

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

94a2ee1

…rf-reduce

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

9e21d77

…rf-reduce

clean up assertions

237253a

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

8757c8a

…rf-reduce

jreback added Performance Memory or execution speed performance Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Dec 1, 2019

TomAugspurger reviewed Dec 5, 2019

View reviewed changes

jbrockmendel added 4 commits December 8, 2019 11:13

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

a1b653d

…rf-reduce

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

f8c3d24

…rf-reduce

consolidate+simplify

ebb33c1

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

c0eb05c

…rf-reduce

revert file not intended

4a16663

jbrockmendel added 2 commits December 26, 2019 12:58

'Merge branch 'master' of https://github.com/pandas-dev/pandas into p…

e4c0466

…erf-reduce

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

3e7da1e

…rf-reduce

jbrockmendel mentioned this pull request Dec 27, 2019

ENH: support datetime64, datetime64tz in nanops.mean, nanops.median #29941

Merged

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

9370b1a

…rf-reduce

jbrockmendel mentioned this pull request Dec 27, 2019

REF: implement cumulative ops block-wise #29872

Merged

5 tasks

TomAugspurger reviewed Dec 30, 2019

View reviewed changes

jreback added this to the 1.0 milestone Dec 30, 2019

jreback merged commit 0aa48f7 into pandas-dev:master Jan 1, 2020

jbrockmendel mentioned this pull request Jan 1, 2020

BUG: Series.any() and .all() don't return bool values if dtype=object #30416

Closed

5 tasks

jbrockmendel deleted the perf-reduce branch January 1, 2020 23:33

hweecat pushed a commit to hweecat/pandas that referenced this pull request Jan 1, 2020

PERF: perform reductions block-wise (pandas-dev#29847)

d7ff4e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: perform reductions block-wise #29847

PERF: perform reductions block-wise #29847

jbrockmendel commented Nov 26, 2019 •

edited

Loading

TomAugspurger Dec 5, 2019

jbrockmendel Dec 5, 2019

jbrockmendel Dec 21, 2019

jbrockmendel Dec 22, 2019

pep8speaks commented Dec 22, 2019 •

edited

Loading

jbrockmendel commented Dec 22, 2019

jbrockmendel commented Dec 25, 2019

jbrockmendel commented Dec 27, 2019

jbrockmendel commented Dec 30, 2019

TomAugspurger left a comment

TomAugspurger Dec 30, 2019

jbrockmendel commented Dec 30, 2019

jreback commented Dec 30, 2019

jbrockmendel commented Dec 30, 2019

jreback commented Jan 1, 2020

PERF: perform reductions block-wise #29847

PERF: perform reductions block-wise #29847

Conversation

jbrockmendel commented Nov 26, 2019 • edited Loading

TomAugspurger Dec 5, 2019

Choose a reason for hiding this comment

jbrockmendel Dec 5, 2019

Choose a reason for hiding this comment

jbrockmendel Dec 21, 2019

Choose a reason for hiding this comment

jbrockmendel Dec 22, 2019

Choose a reason for hiding this comment

pep8speaks commented Dec 22, 2019 • edited Loading

Comment last updated at 2019-12-27 20:20:58 UTC

jbrockmendel commented Dec 22, 2019

jbrockmendel commented Dec 25, 2019

jbrockmendel commented Dec 27, 2019

jbrockmendel commented Dec 30, 2019

TomAugspurger left a comment

Choose a reason for hiding this comment

TomAugspurger Dec 30, 2019

Choose a reason for hiding this comment

jbrockmendel commented Dec 30, 2019

jreback commented Dec 30, 2019

jbrockmendel commented Dec 30, 2019

jreback commented Jan 1, 2020

jbrockmendel commented Nov 26, 2019 •

edited

Loading

pep8speaks commented Dec 22, 2019 •

edited

Loading